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TECHNICAL FIELD: 

5 

These teachings relate generally to data processor hardware and, more specifically, relate to register 
files and to circuit constructions suitable for use in high performance register files containing 
bitlines and wordlines. 

10 BACKGROUND: 

Register files are performance-critical memory components that typically can be found in general 
purpose microprocessors and other types of digital data processors. A register file is typically 
required to meet the following constraints: 1) exhibit a single clock cycle read/write latency that can 
15 support back-to-back read and write operations; and 2) provide multiple read/write port capability 
to enable the simultaneous access by several execution units in a super-scalar architecture. These 
requirements, coupled with the demand for a large number of word entries per port, have 
traditionally necessitated the use of wire-OR type dynamic circuits for the local and global bitlines 
(i.e., for those circuit paths that convey the input and output data bits). 

20 

In accordance with CMOS technology scaling, and in order to achieve high performance, the supply 
voltage V dd and threshold voltage V t are both scaled to maintain approximately the same V dd /V t 
ratio. However, aggressive V t scaling results in an exponential increase in bitline active leakage 
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currents, and also results in a poor bitline noise immunity scaling trend. Therefore, alternate bitline 
circuit techniques that curtail the poor bitline noise immunity scaling trend are required in order to 
achieve high noise immunity while sustaining high performance. 

5 Previous techniques have involved the use of negative wordline drivers, dynamic threshold voltage 
adjustment via substrate/well bias control, and pseudo-static bitlines. 

Figure 1 shows the organization of a conventional 4-read, 2-write ported 256-word x 40-bit/word 
register file 1 . The register file 1 contains four read address decoder circuit sections 2, two write 

10 address decoder circuit sections 3, and a 40-bit register file array 4 arranged as a 40 slice bitline 
stack. A complete read operation is performed in two clock cycles. An 8-bit read/write address per 
port is decoded in section 2 in the first cycle to deliver the read/write select signals into the register 
file array 4. The decoder 2 is non-critical, and therefore can be implemented in conventional static 
CMOS circuitry. In the next cycle, which is critical in terms of performance, the actual bitline read 

15 operation is conducted. Figure 2 shows one bit slice for one read-port path, while Figure 3 shows 
the four full-swing local bitlines. The four local bitlines are totally independent of each other, 
sharing only the bitcells. Each local bitline (LBL) supports 16 bitcells and a two-way merge via a 
static NMOS gate that drives a global bitline (GBL). A bitcell has two-write-ports and 
four-read-ports. Both reading and writing are single ended. 

20 

With regard now to the use of dual-V t dynamic bitlines, the LBL and GBL dynamic ORs are 
susceptible to noise due to high active leakage during evaluation when the precharged domino node 
should stay high. LBL is particularly more sensitive than GBL due to a small domino node stored 
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charge and a wider dynamic OR structure. 



Figure 4 shows a worst-case bitline noise scenario in which all low-V t transistors (LVTs) are used 
to maximize the performance of the read operation. 

5 

A dual-V t LBL uses a high-V t (HVT) on the read-selection transistor and a low-V t (LVT) on the 
bitcell data transistors, as shown in Figure 5. The use of the high-V t transistors limits the bitline 
leakage. However, this benefit is achieved at the cost of degraded performance due to the reduced 
drive currents to the high-V t transistors. 

10 

Figure 6 shows a prior art pseudo-static leakage-tolerant LBL technique. This technique employs 
modifications to the conventional dynamic bitline topology. A first modification is that the 
read-select input and bitcell data locations on the bitline stack are swapped, and the read-select 
signals feed the lower (M2) transistors of the LBL. A second modification is the introduction of 
15 static-precharge transistors (P x ) that are driven actively by the read-select signals. These P x 
transistors anchor the bitline static nodes (V s ) at V dd when the read-selects are at ground potential. 
A third modification is the introduction of static 2-input NOR gates, whose inputs are the bitline 
stack node and bitcell data. The NOR gate outputs drive the upper (Ml) transistors of the LBL. 

20 When the read-select inputs are at GND, the NOR gate outputs force the leakage-limiting Ml 
transistor input to GND. This effectively cuts off the sub-threshold active leakage current path of 
the bitline, since both the drain and the source of the Ml transistor is maximized due to the full V dd 
of the source-body bias, which further elevates the V t . As a result, the bitline noise immunity can 
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be increased. 



However, the benefit of the pseudo-static technique shown in Figure 6 is obtained at the cost of 
degraded performance due to the presence of the additional NOR-gate, and the sub-threshold 
5 leakage through P x and M2. 

SUMMARY OF THE PREFERRED EMBODIMENTS 

The foregoing and other problems are overcome, and other advantages are realized, in accordance 
10 with the presently preferred embodiments of these teachings. 

Based on the foregoing description of the prior art, it can be appreciated that the benefit derived 
from the use of dual-V t bitlines in a register file, such as decreased bitline active leakage currents 
and reduced bitline noise immunity scaling trend, is achieved at the cost of degraded performance 
15 due to a high V t . This invention overcomes these problems by increasing the performance of a 
register file that is constructed to include dual- V t bitlines. The invention employs a boost of the drive 
signal for one of the transistors of a bitline circuit, preferably for the read-selection transistor of a 
local bitline (LBL) circuit. The drive signal amplitude is made greater than the normal supply 
voltage by some increment delta V. 

20 

In one aspect this invention provides a register file supplied by a supply voltage V dd and that 
includes a plurality of threshold voltage V t bitlines. Each of the plurality of V t bitlines includes a 
read-selection transistor having a gate coupled to a dynamic read selection (RS) signal. In 
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accordance with this invention there is a circuit interposed between the RS signal and the gate for 
increasing a level of a drive signal applied to the gate to be greater than V dd . 

In one embodiment the circuit generates the increase in the drive signal using a bootstrap 
5 capacitance, and in another embodiment the increase in the drive signal is achieved by using a 
voltage level shifter circuit that is powered from a supply voltage that exceeds V dd . 

Also disclosed is a method for use in dual-V t bitline circuit that includes a high-V t read-selection 
transistor and a low-V t bitcell data transistor. The method increases the drive current to the high-V t 
1 0 transistor and includes (a) applying a read select (RS) signal; and (b) boosting the maximum voltage 
level of the RS signal so that it exceeds the level of the circuit supply voltage V dd before applying 
the RS signal to the gate of the high-V t read-selection transistor. 

More generally, the teachings of this invention can be used with dual- V t bitlines or single-V t bitlines. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and other aspects of these teachings are made more evident in the following Detailed 
Description of the Preferred Embodiments, when read in conjunction with the attached Drawing 
20 Figures, wherein: 

Figure 1 is schematic diagram of a conventional 256x40-bit register file; 
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Figure 2 is a schematic diagram of a conventional one bit slice for a read-port path of the register 
file of Figure 1; 

Figure 3 shows a conventional local bitline arrangement; 

5 

Figure 4 shows a schematic of a conventional local bitline arrangement for worst-case read-select 
input noise uses two low-V t transistors; 

Figure 5 shows a conventional dual- V t local bitline arrangement that uses only one low-V t transistor; 

Figure 6 is a schematic diagram of a conventional pseudo static low-V t Local bitline arrangement; 

Figure 7 is a schematic diagram of a dual-V t local bitline driven by a CMOS bootstrapped circuit 
in accordance with an embodiment of this invention; 

15 

Figure 8 is a schematic diagram of the dual-V t local bitline driven by the CMOS bootstrapped circuit 
in accordance with a second embodiment of this invention, where a boost ratio control function is 
implemented using a plurality of selectable bootstrap circuits; and 

20 Figure 9 is a schematic diagram of a dual-V t local bitline driven by a column of low-to-high level 
shifter circuits in accordance with a further embodiment of this invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



Figure 7 shows a dual-V t local bitline (LBL) 100 in accordance with this invention. C 0 represents 
the load capacitance of a read wordline (RWL) coupled to the gate of M2. Unlike the conventional 
5 dynamic bitline techniques, the high-V t transistor (M2) is turned on by a dynamic signal RWL 
whose maximum voltage level V RWL is greater than that of the supply voltage V dd applied to the 
remainder of the register file circuits. The dynamic signal RWL, with the maximum voltage level 
V RWU is generated internally by a CMOS bootstrap circuit 105. The CMOS bootstrap circuit 105 is 
constructed to include an inverter 107 that drives p-channel FETs 109 and 111, where FET 1 1 1 is 
10 connected to form a parasitic bootstrap capacitance C B . As a result of the operation of the CMOS 
bootstrap circuit 1 05 the V GS of M2 is boosted by an amount delta V (A V) with parasitic capacitance 
C B after outputting a V dd pulse. The boost ratio is defined by: 

AVA^ dd = (C B /(C B +C 0 ))*V dd . (1) 

15 

As an example, V dd may equal 0.9V, and V dd +AV may equal 1.5V. 

Describing the operation of the circuit in further detail, the bootstrap circuit 105 is connected to a 
p-FET 113, which in turn is connected to an n-channel FET 115. The output taken between FETs 
20 113 and 115, which function as an inverter, is coupled to the RWL, and thus to the gate of the HVT 
transistor M2. The RS input signal is applied to the gate of an input n-channel FET 117. When RS 
goes high (the active state of RS in this example) the output of FET 117 goes low. This turns on p- 
FET 113 and turns off n-FET 115. After a short propagation delay (e.g., about a nanosecond) 
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through inverter 107 the low signal at the input to the inverter 107 goes high at the output of the 
inverter 1 07, thereby turning off p-FETs 1 09 and 111. The bootstrap capacitance C B then discharges 
through the p-FET 1 13 (which is turned on), thereby injecting charge into the RWL bus, resulting 
in the AV boost in the signal appearing at the gate of M2. When RS goes low (inactive) the output 
5 of FET 1 1 7 goes high, thereby turning on p-FETs 1 09 and 1 1 1 via inverter 1 07, and recharging C B 
from V dd , via FET 109. When the output of FET 1 17 goes high this also turns off p-FET 1 13 and 
turns on n-FET 115, thereby discharging the RWL capacitance C 0 through FET 1 15 to ground. 

Figure 8 shows the use of a plurality (e.g., three) bootstrap circuits 105A, 105B and 105C coupled 
10 together in parallel that are individually selectable with select (SEL) signals SEL(0), SEL(l) and 
SEL(2), respectively. Note that the bootstrap circuits 105 A, 105B andl05C can be identical except 
for the value of C B (4*C B , 2*C B and C B , respectively). The boost ratio may be defined as follows: 

AV SEU0:2] A^ dd =(C x /(C x +C o ))*V dd , (2) 

15 

where 

[C x =SEL[0] *4*C B +SEL[ 1 ]*2*C B +SEL[2] *C B ], 

20 and where * denotes multiplication. Note that more than one of the selection signals can be active 
at any given time. 



Figure 9 is a schematic diagram of a dual-V t local bitline driven by a low-to-high voltage level 
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shifter circuit 120 in accordance with a further embodiment of this invention. In this embodiment 
the dynamic signal RWL is boosted up by supplying only the voltage level shifter circuit 120 with 
a higher supply voltage (V ddH ) that the normal supply voltage V dd used by the other circuits of the 
register file (e.g., 1.5V versus 0.9V). 

5 

These techniques solve the degraded performance problem of a register file with dual-V t dynamic 
bitlines, as well as the poor bitline noise immunity scaling trend due to active leakage currents. 

Based on the foregoing description of the preferred embodiments it should be appreciated that this 
1 0 invention provides a technique to enhance the performance of the register file with dual- V t dynamic 
bitlines. In the dual-V t bitlines the high-V t transistors (M2) are turned on by a dynamic signal with 
a higher voltage than V dd . The boosted dynamic signal makes the gate-to-source voltage of the 
high-V t transistors greater than the drain-to-source voltage. The dynamic signal is internally 
generated by, in one embodiment, the CMOS bootstrapped circuit 105 or, in another embodiment, 
15 by the column of low-to-high voltage level shifter circuits 120. 

The foregoing description has provided by way of exemplary and non-limiting examples a full and 
informative description of the best method and apparatus presently contemplated by the inventors 
for carrying out the invention. However, various modifications and adaptations may become 
20 apparent to those skilled in the relevant arts in view of the foregoing description, when read in 
conjunction with the accompanying drawings and the appended claims. As but some examples, the 
use of other similar or equivalent types of circuits to boost the drive signal to a higher voltage signal 
may be employed. Similarly, this invention can be used with any type of register file structure with 
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dual-V t or multiple- V t dynamic bitlines. In addition, it should be noted that this invention can be 
used as well with single V t architectures, where the read-selection transistor and the data transistor 
are both either low-V t (LVT) or high-V t (HVT) transistors. Furthermore, while shown in Figures 7, 
8 and 9 to be applied to the local bitlines (LBLs), this invention can be applied as well to global 
5 bitlines (GBLs). However, all such and similar modifications of the teachings of this invention will 
still fall within the scope of this invention. 
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